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MORE ECONOMICAL RESOURCE APPUCATION ON THE USER ' 
INTERACTION WITH A SPEECH DIALOGUE SYSTEM IN A PACKET 
NETWORK BY MEANS OF A SIMPLIFYING PROCESSING OF iSIGNALLING 
INFORMATION 

CROSS REFERENCE TO RELATED APPLICATION 
[0001] This application is the US National Stage of International Application No. 
PCT/EP2004/05 1 128, filed June 16, 2004 and claims the benefit thereof. The 
International Application claims the benefits of German Patent application No. 103 27 
290.9 DE filed June 17, 2003, both of the applications are incorporated by reference 
herein in their entirety. 

FIELD OF THE INVENTION 
[0002] The invention relates to methods and devices for a simplifying processing of 
signaling information during a dialogue with a speech dialogue system in a packet 
network. 



BACKGROUND OF THE INVENTION 
10003] One of the most important current developments affecting the fields of network 
technologies, of call processing, and of Internet technologies, is the realization of services 
with real-time transmission via packet networks. 



[0004] At the present moment, most speech transmission is handled via line switched 
networks - also knovra as TDM (time division multiplexing) networks. The aim for the 
future is to transmit a greater amount of speech via packet oriented networks which are 
currently used mainly for data transmission. Hereby, the so called IP (Internet protocol) 
networks are the most important class of packet networks. In addition, in future there will 
be fiuther transmission capacity intensive real-time services, such as, for example, the 
transmission of video data during a video-on-demand service. 

[0005] An important class of real-time services is the automated provision of speech 
or video information. One example of this type of service is given by the recorded 
announcement services known fi-om TDM networks, e.g. telephone number 
announcements („the telephone number of the subscriber is ...") or error messages („the 
number you are trying to call is not available"). Thereby, automated information output 
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can also contain subscriber specific information (e.g. telephone numbers). Dialogue 
ftmctionality is an extension of the announcement functionality. There the user can control 
the service or the dialogue by using the keys in his terminal device or by means of speech 
input. Servers are used to achieve such kinds of services in packet networks. In the case of 
interactive services, the term IVR (interactive voice response) server is commonly used. 
A number of coding methods or codecs (coder-decoder), such as, for example, G.711A/u, 
G.723.1, G.726, G.728 and G.729A/B were standardized for the transmission of speech. 
Standards H.261 and H.263, for example, are used for the transmission of video 
information. For an information output, usually a codec or coding method that is 
supported by both ends of the network is selected for the information transmission in a so 
called codec negotiation. 

[0006] For services with real-time transmission via data networks, it is essential that 
the service characteristics known fix>m the TDM network be provided for corresponding 
or new services with comparable quality and efficiency. The optimization of the resource 
application plays an important part in this. 

SUMMARY OF THE INVENTION 
[0007] The task of the invention is to improve the efiGciency of the resource 
application in the automated information output. 

[0008] The invention is based on the following consideration. The signaling with 
relation to an interactive dialogue with a speech dialogue system or an IVR (Interactive 
Voice Response) server, e.g. for an ou^ut of information, is usually carried out using 
DTMF signals (DTMF: Dual Tone Multiple Frequency). With this signaling - also 
frequently called tone dialing or dual tone frequency dialing - an interaction between the 
subscriber and the speech dialogue system is realized by means of an exchange of coded 
information through frequencies. Three scenarios can be distinguished in the transmission 
of DMFT signals via a packet network: 

[0009] • The DMFT signals are contained in the payload stream. In this connection, 
one also talks about in-band transmission. In-band transmission is only used in 
conjunction with non-compressing coding methods or codecs such as, for example, G.711. 
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[0010] • DMFT signals and payload are transmitted separately, i.e. out-of-band 

transmission of DTMF signaling information is carried out. 

[001 IJ • Die DMFT signals are transmitted in the payload stream in separately labeled 
data packets. A transmission of this type was standardized by the IETF in the Request for 
Comments RFC 2833 for the RTP (real-time protocol) packet format. 

[0012] With in-band transmission of DTMF signals, usually special hardware 
resources, for example, designed with DSPs (DSP: digital signaling processor) or ASICs 
(ASIC: Application specific integrated circuit) are required in the speech dialogue system 
or the IVR server for the analysis of the DTMF signals. With relation to the invention, in- 
band transmission of DTMF signals is largely avoided and the use of speech dialogue 
systems or IVR servers without hardware resources for the recognition of DTMF signaling 
is proposed. 

[0013] The coding method and the type of exchange of DTMF signals for an 
automated information output is usually determined during a so called codec negotiation 
between packet network terminals. The first packet network terminal is represented, for 
example, by a network interface device or a media gateway or by a packet based terminal 
linked directly to the packet network. The second packet network terminal is the speech 
dialogue system. In the codec negotiation a codec supported by both teiminals and by the 
network is selected fix>m a list of codecs. Usually, when a codec is selected, by de&ult or 
by presetting, the type of transmission of the DTMF signals is also determined, e.g. the 
selection of the coding method G.71 1 is linked to in-band transmission of the DTMF 
signals. With relation to the invention, two methods will be shown to exclude in-band 
transmission! 

[0014] In the first method, only out-of-band-signaling or signaling by means of 
specially labeled data packets is permitted in the codec negotiation. Coding methods that 
involve an in-band signaling of DTMF signals are practically eliminated from the list of 
applicable codecs during the codec negotiation. 

[0015] The second method includes an extension of the logic of the control device 
controlling the speech dialogue system. The control device (e.g. a packet based exchange. 



2003P08761WOUS Substitute Specification CKM 



Attorney Docket No. 2003P08761 WOUS 



a call server, a proxy server or a soft switch) is embodied such that said control device 
signals to the remote packet network terminal involved in the codec negotiation to use out- 
of-band-signaling of DTMF signals independently of the selected codec. In this case, as a 
rule the codec G.71 1 can also be selected. 

[0016] The invention has the advantage that speech dialogue systems can be used 
without complex hardware resources. In principle, for services with user interaction or 
automated information output, it is then also possible to use so. called general purpose 
platforms, that is multi-function computers with open inter&ces that provide the desired 
rVR or information output resources through their software tools. The provision of 
dedicated hardware is no longer necessary. 

[0017] According to a flirther embodiment - as a backup so to speak - provision is 
made for the rerouting of the job to a speech dialogue system with dedicated hardware for 
those cases where the in-band signaling of DTMF signals cannot be achieved with the 
above procedure. That would be the case in the first method if no codec supported by the 
two packet network terminals can be identified with out-of-band signaling and there is 
also no provision for signaling by means of a packet specifically provided for DTMF 
signals. In this case, accordmg to the fiirther embodiment, the service is switched to a 
speech dialogue system with dedicated hardware. With the second method, in the course 
of which, a control device stipulates out-of-band-signaling for the first packet network 
terminal independently of the codec selected, it is possible to forward the call to the 
speech dialogue system with hardware for DTMF signal recognition if the necessary 
resources or technical support for the out-of-band transmission are not available. 

[0018] The further embodiment allows a service to be dealt with also in cases where it 
is not possible to carry out a service with the speech dialogue system without special 
hardware. Otherwise, service requirements of that type would have to be refused. As a 
rule, however, it will be possible to provide the service through the speech dialogue 
system withoiit special hardware. Therefore, the provision of one backup speech dialogue 
system will be sufficient for a large number of speech dialogue system without special 
hardware. Alternatively, the speech dialogue system with special hardware can be 
assigned only to one speech dialogue system without dedicated hardwaire, whereby the 
speech dialogue system without special hardware is accordingly more powerfully sized, 
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i.e. with respect to the available resources it is designed for the processing of a lot more 
service requirements per time imit than the other speech dialogue system. 

BRIEF DESCRIPTION OF THE DRAWING 
10019] In the follovving, the subject of the invention will be described in more detail 
below with reference to an embodiment using a figure. 

[0020] Figure 1 - figure 1 is an illustration of the preferred embodiment of the 
invention. 

DETAILED DESCRIPTION OF THE INVENTION 
[0021] There are two IVR servers, IVR 1 and rVR2, represented in the figure, with the 
first IVR server IVRl having no special hardware for the processiog of DTMF signals, the 
second IVR server, however, does. Subscribers TLN are represented that are connected to 
a Ti)M network PSTN/ISDN. Voice communications of fte subscriber TLN can be 
switched via a switching system switch. The TDM network ISDN/PSTN is connected 
with an IP network IPNet by means of a media gateway GW. Here, this is, for example, a 
core network. Within the .IP network IPNet, signaling information and payload ND are 
routed separately. Signaling information coming from the TDM network IDSN/PSTN is 
transferred via a so called signaling transfer point STP to a call server CS2. Signaling is 
carried out with the help of signals of the SS7 signaling system. Using the MGCP protocol 
(Media gateway Control Protocol), the call server CS2 exchanges signaling information 
with the gateway GW and the speech dialogue systems IVRl and IVR2. The H,248 
protocol could also be used instead of the MGCP protocol. With the help of the MGCP 
(Media gateway Control Protocol) protocol, the call server CSl can also control the 
speech dialogue system IVRl. 

[0022] According to the invention, the selection of a codecs is carried out in the course 
of an interactive dialogue with a subscriber TLN as follows: 

[0023] With relation to the codec negotiation, a type of DTMF signaling without in- 
band signaling is required through the selection of the coding method, which DTMF 
signaling occurs in the packet network by means of signaling between the IP terminals and 
the call server CS2. This procedure executes in such a way that the A-side IP terminal, 
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represented by the gateway GW, signals a prioritized list of voice codecs, fax and tone 
capabilities to the call server CS2 during the call set-up. In compliance with admihistrable 
defaults, the call server CS2 can delete from the list the voice codecs/ capabilities that 
should not be used in the network, or said call server can change the priorities. The 
modified list is delivered to the B-side IP terminal, in this case the speech dialogue system 
rVRl. This system compares the list received via the signaling list with its own list and 
eliminates the voice codecs/ capabilities that are not contained in both lists. The list thus 
checked and possibly modified is signaled back to the A-side IP terminal via the call 
server CS2, and sets the selection of voice codecs/ capabilities that are to be used. 

[0024] During the codec negotiation the speech dialogue system IVRl only offers the 
signaling to RFC2833 and compressing voice codecs, which do not permit an in-band 
DTMF signaling and therefore inevitably result in a DTMF out-of-band signaling. That 
means that in special the usually used, but DTMF transparent regression coders (e.g. 
G.71 1) are not contained in the codec list signaled back by the IVR. The demand for the 
DTMF out-of-band signaling is made by the call server CS2 by means of signaling to the 
A-side IP terminal. To this end the call server CS2 has a logic that checks the voice codec 
negotiated by the codec negotiation. If this is a compressing voice codec (e.g. G.723), the 
call server signals the DTMF out-of-band transmission to the A-side IP terminal. 

[0025] There is a peripheral device with virtual annoimcement and/or dialogue ports in 
the call server CS2. The speech dialogue system IVRl and possibly also the speech 
dialogue system IVR2 are controlled via said peripheral device. This peripheral device 
with virtual announcement and/or dialogue ports converts the seizures of its ports by the 
call server CS2 into seizure signaling of the allocated ports leading to the speech dialogue 
systems. This peripheral device also outputs the jobs to play announcements and dialogues 
towards the speech dialogue systems. Acknowledgements fiom the speech dialogue 
systems IVRl or IVR2 indicating the end of the announcement or containing the input of 
the end user ensue at the assigned peripheral device with virtual annoimcement and/or 
dialogue port. All the signaling between the peripheral device responsible for the virtual 
announcement and/or dialogue port and the assigned external speech dialogue systems 
IVRl or rVR2 ensues via the signaling protocol MGCP that is used to access the media 
gateway MG. 
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[0026] For the case that the A-side IP terminal does not support any of the voice 
codecs offered by the IVRl that lead to DTMF out-of-band signaling, or RFC2833, the 
service requirement is automatically rerouted or forwarded to an alternative speech 
dialogue system rVR2 that also supports the voice codecs with in-band DTMF signaling. 
Since, as a rule, there is only a small number of IP terminals that exclusively support voice 
codecs with in-band DTMF signaling, the channel number of the speech dialogue system 
IVR2 can be substantially smaller than that of the speech dialogue system IVRl, thus 
achieving an optimization of costs for the overall IVR fimctionality to be made available. 

[0027] The codec negotiation that takes place during the call set-up is used as a trigger 
event for the rerouting of the speech dialogue system IVRl to the speech dialogue system 
rVR2. The speech dialogue system IVRl determines on the basis of the codec negotiation 
that there is no match between the voice codecs of the A-side IP terminal and of the 
speech dialogue system IVRl, and signals a corresponding error (e.g. error code 543 
„Codec Negotiation Error") to the peripheral device with virtual announcement and/or 
dialogue port in the call server. The peripheral device evaluates this error and, by means of 
the data link control in the call server, initiates hereupon a removal of the connection to 
the speech dialogue system IVRl, followed by a setting up of the connection to the speech 
dialogue system rVR2. The connection to the A-side IP terminal is maintained during this 
rerouting procedure. The addresses of the speech dialogue systems IVRl and IVR2 are 
administered in the database of the call server CS2. 

[0028] It is also conceivable that the speech dialogue system with special hardware 
IVR2 is controlled by a different call server from the call server CS2, e.g. by the call 
server CSl. In this case, the rerouting from the speech dialogue system IVRl to the speech 
dialogue system IVR2 can be achieved by exchanging appropriate signaling information 
between the two call servers CS2 and CSl e.g. by means of the BICC (Bearer ladependent 
Call Control) protocol. 

[0029] An alternative procedure is based on the extension of the logic in the call 
server CS2 to require the DTMF out-of-band signaling. This extension is that the logic 
checks whether the B-side IP terminal is the speech dialogue system IVRl. In this case the 
speech dialogue system IVRl also offers the non-compressing voice codecs, and the call 
server CS2, independently of the selected voice codec, always signals DTMF out-of-band 
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transmission to the A-side IP tenninal. 
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